Goto

Collaborating Authors

 bias mitigation algorithm


Machine Learning Fairness in House Price Prediction: A Case Study of America's Expanding Metropolises

arXiv.org Artificial Intelligence

As a basic human need, housing plays a key role in enhancing health, well-being, and educational outcome in society, and the housing market is a major factor for promoting quality of life and ensuring social equity. To improve the housing conditions, there has been extensive research on building Machine Learning (ML)-driven house price prediction solutions to accurately forecast the future conditions, and help inform actions and policies in the field. In spite of their success in developing high-accuracy models, there is a gap in our understanding of the extent to which various ML-driven house price prediction approaches show ethnic and/or racial bias, which in turn is essential for the responsible use of ML, and ensuring that the ML-driven solutions do not exacerbate inequity. To fill this gap, this paper develops several ML models from a combination of structural and neighborhood-level attributes, and conducts comprehensive assessments on the fairness of ML models under various definitions of privileged groups. As a result, it finds that the ML-driven house price prediction models show various levels of bias towards protected attributes (i.e., race and ethnicity in this study). Then, it investigates the performance of different bias mitigation solutions, and the experimental results show their various levels of effectiveness on different ML-driven methods. However, in general, the in-processing bias mitigation approach tends to be more effective than the pre-processing one in this problem domain. Our code is available at https://github.com/wahab1412/housing_fairness.


Different Horses for Different Courses: Comparing Bias Mitigation Algorithms in ML

arXiv.org Artificial Intelligence

With fairness concerns gaining significant attention in Machine Learning (ML), several bias mitigation techniques have been proposed, often compared against each other to find the best method. These benchmarking efforts tend to use a common setup for evaluation under the assumption that providing a uniform environment ensures a fair comparison. However, bias mitigation techniques are sensitive to hyperparameter choices, random seeds, feature selection, etc., meaning that comparison on just one setting can unfairly favour certain algorithms. In this work, we show significant variance in fairness achieved by several algorithms and the influence of the learning pipeline on fairness scores. We highlight that most bias mitigation techniques can achieve comparable performance, given the freedom to perform hyperparameter optimization, suggesting that the choice of the evaluation parameters-rather than the mitigation technique itself-can sometimes create the perceived superiority of one method over another. We hope our work encourages future research on how various choices in the lifecycle of developing an algorithm impact fairness, and trends that guide the selection of appropriate algorithms.


Toward Mitigating Sex Bias in Pilot Trainees' Stress and Fatigue Modeling

arXiv.org Artificial Intelligence

While researchers have been trying to understand the stress and fatigue among pilots, especially pilot trainees, and to develop stress/fatigue models to automate the process of detecting stress/fatigue, they often do not consider biases such as sex in those models. However, in a critical profession like aviation, where the demographic distribution is disproportionately skewed to one sex, it is urgent to mitigate biases for fair and safe model predictions. In this work, we investigate the perceived stress/fatigue of 69 college students, including 40 pilot trainees with around 63% male. We construct models with decision trees first without bias mitigation and then with bias mitigation using a threshold optimizer with demographic parity and equalized odds constraints 30 times with random instances. Using bias mitigation, we achieve improvements of 88.31% (demographic parity difference) and 54.26% (equalized odds difference), which are also found to be statistically significant.


Practical Bias Mitigation through Proxy Sensitive Attribute Label Generation

arXiv.org Artificial Intelligence

Machine Learning has attained high success rates in practically Similarly, zip codes can be correlated with race. Hence, every field, including healthcare, finance, and education, the bias gets embedded in the non-sensitive attributes that based on the accuracy and efficiency of the model's are used in the model training. Based on this hypothesis, a outcome (Dastile, ร‡elik, and Potsane 2020; Bakator and few initial efforts have been made to mitigate bias in the Radosav 2018). However, these models are biased and exhibit absence of protected attributes (Grari, Lamprier, and Detyniecki a propensity to favor one demographic group over another 2022; Lahoti et al. 2020; Yan, Kao, and Ferrara in various applications, including credit and loan approval, 2020; Zhao et al. 2022). The most recent approach (Zhao criminal justice, and resume-based candidate shortlisting et al. 2022) identifies related features that are correlated with (Mehrabi et al. 2021; Gianfrancesco et al. 2018; Yapo the sensitive attributes and would further minimize the correlation and Weiss 2018). The idea of fairness has received a lot of between the related features and the model's prediction attention recently to combat the discrimination from the outcome to learn a fair classifier with respect to the sensitive of ML models (Dwork et al. 2012; Beutel et al. 2017; attribute. However, identification of related features require Hardt, Price, and Srebro 2016).


MEDFAIR: Benchmarking Fairness for Medical Imaging

arXiv.org Artificial Intelligence

A multitude of work has shown that machine learning-based medical diagnosis systems can be biased against certain subgroups of people. This has motivated a growing number of bias mitigation algorithms that aim to address fairness issues in machine learning. However, it is difficult to compare their effectiveness in medical imaging for two reasons. First, there is little consensus on the criteria to assess fairness. Second, existing bias mitigation algorithms are developed under different settings, e.g., datasets, model selection strategies, backbones, and fairness metrics, making a direct comparison and evaluation based on existing results impossible. In this work, we introduce MEDFAIR, a framework to benchmark the fairness of machine learning models for medical imaging. MEDFAIR covers eleven algorithms from various categories, nine datasets from different imaging modalities, and three model selection criteria. Through extensive experiments, we find that the under-studied issue of model selection criterion can have a significant impact on fairness outcomes; while in contrast, state-of-the-art bias mitigation algorithms do not significantly improve fairness outcomes over empirical risk minimization (ERM) in both in-distribution and out-of-distribution settings. We evaluate fairness from various perspectives and make recommendations for different medical application scenarios that require different ethical principles. Our framework provides a reproducible and easy-to-use entry point for the development and evaluation of future bias mitigation algorithms in deep learning. Code is available at https://github.com/ys-zong/MEDFAIR.


Mitigating AI Bias, with โ€ฆBias

#artificialintelligence

This article is part of my Data Trust series of talks and writing. The purpose of these articles are to break down complex but important socio-technical topics in a manner that is accessible to both practitioners and non-practitioners. Most tools we use today leverage AI/ML from the moment we wake up and while we sleep. Humans build Machine Learning, and humans are inherently biased. Since humans aren't perfect, we encode our biases into the data we use to train AI.


On the Basis of Sex: A Review of Gender Bias in Machine Learning Applications

arXiv.org Artificial Intelligence

Machine Learning models have been deployed across almost every aspect of society, often in situations that affect the social welfare of many individuals. Although these models offer streamlined solutions to large problems, they may contain biases and treat groups or individuals unfairly. To our knowledge, this review is one of the first to focus specifically on gender bias in applications of machine learning. We first introduce several examples of machine learning gender bias in practice. We then detail the most widely used formalizations of fairness in order to address how to make machine learning models fairer. Specifically, we discuss the most influential bias mitigation algorithms as applied to domains in which models have a high propensity for gender discrimination. We group these algorithms into two overarching approaches -- removing bias from the data directly and removing bias from the model through training -- and we present representative examples of each. As society increasingly relies on artificial intelligence to help in decision-making, addressing gender biases present in these models is imperative. To provide readers with the tools to assess the fairness of machine learning models and mitigate the biases present in them, we discuss multiple open source packages for fairness in AI.


Hidden Technical Debts for Fair Machine Learning in Financial Services

arXiv.org Artificial Intelligence

The recent advancements in machine learning (ML) have demonstrated the potential for providing a powerful solution to build complex prediction systems in a short time. However, in highly regulated industries, such as the financial technology (Fintech), people have raised concerns about the risk of ML systems discriminating against specific protected groups or individuals. To address these concerns, researchers have introduced various mathematical fairness metrics and bias mitigation algorithms. This paper discusses hidden technical debts and challenges of building fair ML systems in a production environment for Fintech. We explore various stages that require attention for fairness in the ML system development and deployment life cycle. To identify hidden technical debts that exist in building fair ML system for Fintech, we focus on key pipeline stages including data preparation, model development, system monitoring and integration in production. Our analysis shows that enforcing fairness for production-ready ML systems in Fintech requires specific engineering commitments at different stages of ML system life cycle. We also propose several initial starting points to mitigate these technical debts for deploying fair ML systems in production.


Tackling Bias and Explainability in Automated Machine Learning

#artificialintelligence

Automated machine learning is likely to introduce two critical problems. Fortunately, vendors are introducing tools to tackle both of them. Adoption of automated machine learning -- tools that help data scientists and business analysts (and even business users) automate the construction of machine learning models -- is expected to increase over the next few years because these tools simplify model building. For example, in some of the tools, all the user needs to do is specify the outcome or target variable of interest along with the attributes believed to be predictive. The automated machine learning (autoML) platform picks the best model.


Machine learning and bias

#artificialintelligence

Bias is a prejudice in favor or against a person, group, or thing that is considered to be unfair. But as machine learning becomes more of an integral part of our lives, the question becomes will it include bias? In this article, I'll dig into this question, its impact, and look at ways of eliminating bias from machine learning models. Machine learning has shown great promise in powering self-driving cars, accurately recognizing cancer in radiographs, and predicting our interests based upon past behavior (to name just a few). But with the benefits from machine learning, there are also challenges.